Quantifying the Utility of the Past in Mining Large

نویسنده

  • Vikram Pudi
چکیده

| Incremental mining algorithms that can eeciently derive the current mining output by utilizing previous mining results are attractive to business organizations since data mining is typically a resource-intensive recurring activity. In this paper, we present the DELTA algorithm for the robust and eecient incremental mining of association rules on large market basket databases. DELTA guarantees eeciency by ensuring that, for any dataset, at most three passes over the increment and one pass over the previous database are required to generate the desired rules. Further, it handles \multi-support" environments where the support requirements for the current mining diier from those used in the previous mining, a feature in tune with the exploratory nature of the mining process. We present a performance evaluation of DELTA on large databases over a range of increment sizes and data distributions, as well as change in support requirements. The experimental results show that DELTA can provide signiicant improvements in execution times over previously proposed incremental algorithms in all these environments. In fact, for many workloads, its performance is close to that achieved by an optimal, but practically infeasible, algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

High Fuzzy Utility Based Frequent Patterns Mining Approach for Mobile Web Services Sequences

Nowadays high fuzzy utility based pattern mining is an emerging topic in data mining. It refers to discover all patterns having a high utility meeting a user-specified minimum high utility threshold. It comprises extracting patterns which are highly accessed in mobile web service sequences. Different from the traditional fuzzy approach, high fuzzy utility mining considers not only counts of mob...

متن کامل

A New Algorithm for High Average-utility Itemset Mining

High utility itemset mining (HUIM) is a new emerging field in data mining which has gained growing interest due to its various applications. The goal of this problem is to discover all itemsets whose utility exceeds minimum threshold. The basic HUIM problem does not consider length of itemsets in its utility measurement and utility values tend to become higher for itemsets containing more items...

متن کامل

Restoring the past glory of Diamond Mining in south India- A plausible case of diamondiferous Wajrakarur kimberlite pipe clusters with geochemical evidences

A plausible case of collective and economical mining of diamondiferous kimberlite deposits of Wajrakarur and adjoining places in Andhra Pradesh, southern India along with the whole-rock geochemical evidences in support of their diamond potentiality are discussed in this article. The kimberlites/lamproites are mantle-derived ultrabasic rocks which rarely carry diamonds from mantle to the earth’s...

متن کامل

Data sanitization in association rule mining based on impact factor

Data sanitization is a process that is used to promote the sharing of transactional databases among organizations and businesses, it alleviates concerns for individuals and organizations regarding the disclosure of sensitive patterns. It transforms the source database into a released database so that counterparts cannot discover the sensitive patterns and so data confidentiality is preserved ag...

متن کامل

3D model construction of induced polarization and resistivity data with quantifying uncertainties using geostatistical methods and drilling (Case study: Madan Bozorg, Iran)

Madan Bozorg is an active copper mine located in NE Iran, which is a part of the very wide copper mineralization zone named Miami-Sabzevar copper belt. The main goal of this research work is the 3D model construction of the induced polarization (IP) and resistivity (Rs) data with quantifying the uncertainties using geostatistical methods and drilling. Four profiles were designed and surveyed us...

متن کامل

Choosing the Optimum Underground Mine Layout with Regard to Metal Price Uncertainty Using Expected Utility Theory

Metal price is one of the most important parameters in the calculation of cut- off grade. The cut- off grade has the main role in determination of mine layout. Mine layout actuates mineable reserve, mine life and economic profitability. Not considering the uncertainty in metal prices can lead to a non-optimal layout. In this paper optimum underground mine layout is determined by expected utilit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000